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ABSTRACT 



In this report, a mathematical model of a data 
communications system using retransmission error 
control is developed. System performance is spec- 
ified by information throughput, the amount of error- 
free data transferred per unit of time. Information 
throughput is determined as a function of message 
length, taking into account transmission delay, chan- 
nel error rate, and overhead words in the data 
message. 
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SECTION I 
INTRODUCTION 

In digital data communications, error detection and retransmission is 
one method of obtaining essentially error-free transfer of data over a bursty 
channel between a data source and a remotely located data sink. A perform- 
ance measure of a data communications system is information throughput , 
the amount of error-free data transferred per unit of time. In this report, a 
previously developed mathematical model is extended to permit determina- 
tion of information throughput as a function of message length, taking into 

account transmission delay, channel error rate, and overhead words in the 

[2] 
data message. Kuhn , who previously examined this problem, did not 

include in his analysis the effects of transmission delay or overhead words. 

The data communications system, the mathematical model of the system 
for which the throughput is calculated, the error characteristics of the channel, 
and the graphical solutions to the throughput equation, which permits determina- 
tion of optimum message length for maximum throughput under changing 
operating parameters of channel error rate, delay, and message overhead, 
are described in this report. 
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section n 

THE DATA COMMUNICATIONS SYSTEM 

A functional block diagram of a typical data communications system 
is shown in Figure 1. A data source sends a sequence of digital information 
via a noisy channel to a remotely located sink. The data are encoded into 
information blocks called words. The data words are assembled into message 
blocks containing N words. A data word is considered in error if one or 
more errors appear in the received data word. A message is considered in 
error if one or more of the N words in the message are received in error. 

The sink is assumed to have the capability always to detect errors in 
the received data. This is a reasonable assumption since very powerful 
error-detecting codes are available. If a message in error is detected, the 
sink requests the source to go back in the data sequence to the message 
received in error and recommence transmission. The sink then ignores all 
incoming messages until the retransmitted data message is received cor- 
rectly. 

The time interval between the detection of a message in error and 
reception of the retransmission is the sum of the detection time and the 
round-trip transmission time of the communications system. The error- 
detection time is the time it takes for the sink to receive the message or 
N word intervals. The round-trip transmission time of the communica- 
tions system is D word intervals. 

Of the N words in each message, I words are overhead, i.e., they 
carry no information to the data sink. These I words are used to identify 



NOISE 



DATA 
SOURCE 



M CHANNEL 



ERROR 

DETECTING 
SINK 



Figure 1. Block Diagram of the Data Communications System 



the message, provide error detection, and maintain system synchronization. 
The remaining N-I words of the message contain the useful information 
transferred over the communications system. 



section m 

ERROR CHARACTERISTICS OF THE CHANNEL 

Error measurements on various communication media, such as high- 
frequency radio and troposcatter, indicate that these communications 
channels are bursty, i.e., the error occurrences are highly correlated. 

Attempts have been made to model these channels and to list their relevant 

[3 4] 
statistical characteristics in order to describe the behavior of the 

channel error patterns. 

A quantity commonly used to describe the channel is average word- 
error rate, P. This number is found by averaging the number of words in 
error in a fixed interval of the data stream over the total number of words 
in the interval. For example, if 10 words were in error in an interval of 10 4 
words, the average word-error rate would be 10 divided by 10 4 or 10~ 3 . 

For a channel with random, uncorrelated error patterns, average word- 
error rate is a good measure of transmission quality. For bursty channels, 
however, average word-error rate is not a meaningful measure unless 
additional higher order probabilities describing the random channel behavior 
are given. 

In this report, however, average word-error rate alone is used to 
illustrate the method. Higher order probability terms would vary the 
calculated results, but not the method employed. 



SECTION IV 
THE MATHEMATICAL MODEL 

GENERAL 

The data transmission system attempts to send messages over the 
channel. Errors occur in the messages at an average rate, P, and cause 
retransmissions which interrupt the flow of information. 

Expressions for the average length of a sequence of error-free data 
messages, the average number of retransmissions necessary to receive an 
error-free message, and the information throughput of the communications 
system are derived below. 

AVERAGE LENGTH OF AN ERROR-FREE SEQUENCE 

Let P equal 1-P, where P is the average word-error rate. From 
the theory of combinatorial analysis, it can be shown that the probability of a 
sequence of n words being transmitted without any errors, L(n), is given by 

L(n) = P P n . (1) 

The average sequence length, n, is given by 

00 

n = £ n L(n) . (2) 



n=0 



Substituting (1) into (2) gives 



n =£ n P P n . (3) 



n 
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Equation (3) reduces to 

P P P 

Therefore, the average length of the errorless sequence is P/P words. 

AVERAGE NUMBER OF RETRANSMISSIONS WITH ERRORS 

From the mathematics of combinatorial analysis, it can be shown that 
the probability of x retransmissions to receive an error -free retrans- 
mission K(x) is given by 

K(x) = P P X . (5) 

The average number of retransmissions to receive an error -free retrans- 
mission is given by 

00 

x = £ xK(x) . (6) 

x=0 

Substituting (5) into (6) yields 



x = P 

x=0 

Equation (7) is similar to Equation (3) and reduces to 



Y *p x . ( ? ) 



x = — . (8) 



P 
From Equation (4), therefore, 



1 

x = — 

n 



(9) 



TRANSMISSION OF N WORD MESSAGES 

When data are transmitted in groups of N words, all N words 
must be received without errors or a retransmission is requested. The 
probability of successful transmission P(N) is given by 

P(N) = P N . (10) 

The probability that the N word block contains at least one error, P(N), 
is given by 

P(N) = 1-P N . (11) 

This relationship is plotted in Figure 2, which shows P(N) as a function of 
N and P or 1-P. 

The average number of blocks transmitted without errors is represented 
by n^, and, in order to receive a message which is retransmitted without 
errors, the average number of retransmissions of N word blocks is rep- 
resented by x^j . The transmission with errors which initiated the retrans- 
mission sequence is not included in xl,. Substituting Equations (9) and (10) 
into Equations (4) and (8) yields 

pN 



and 



x XT = 1-P N (13) 

N ~^~ -. 
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Figure 2. Probability Error of N Word Messages P(N) Versus N and 
Word-Error Probability P 



INFORMATION THROUGHPUT 

Transmission of an error-free message results in N data words 
being accepted by the sink. An error-free sequence of n messages results 
in N n data words being accepted by the sink. A retransmission caused by 
the occurrence of an error and the termination of a sequence of errorless 
messages results in a transmission interruption of (D+ N) words, which is 
the sum of the round-trip transmission time and the error-detection time. 
The average interruption duration is (D + N) times the average number of 
times (1 + x) that a message must be transmitted and retransmitted in order 
to be received without errors. 

Information throughput, T , is defined as the amount of error-free 
information transferred per unit time. This quantity is the ratio of the 
average information transferred between retransmissions to the informa- 
tion which would be passed if there were no retransmissions. Thus, 

T Nn . (14) 

N 

Nn + (D+N) (1+x) 

Substituting Equations (11) and (12) into Equation (13) yields 

NP N /l-P N 

Tn = n(p n /i-p n ) + (d + n)[i + (i-p n /p n )] ' (15) 

Equation (14) may be rewritten as 

-2N 
N P 
T = — " " — — . (16) 



" N n(p 2N ) + (d + n) (i-p n ) 
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Since each N word message contains I overhead words, which 
results in a relative fraction, R, of noninformation-carrying words per 
message where 

R = 3j , (17) 

then the actual information throughput, T, is given by 

T = T N (1-R) . (18) 

Substituting Equations (15) and (16) into Equation (17) yields 



N P 2N [1 



- : /n) 



T = : (1Q\ 

NP 2N + ( D + n) (l-P N ) 

Equation (18) has been calculated by a computer using the values of P, 
N, D, and I shown in Table I. Figures 3 through 18 are plots of Equation 
(18). Each plot shows information throughput, T, versus message length, 
N, for various values of P and constant values of D and I. 

By inspecting Figures 3 through 18 it is obvious that for given 
values of P, D, and I there is a unique message length that maximizes 
the information throughput. Message lengths greater or less than the 
optimum will result in decreased throughput. Tables II through VI show 
the optimum message length to yield maximum throughput versus P 
for various values of D and I. 
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Table I 

Values of P, N, D, and I 
Used in Computer Solution of Equation (19) 



p 


N 


D 


I 


1 - 10" 2 


2 1 


10 


1 


1 - 10" 3 


2 s 


40 


2 


1 - 10~ 4 


2 3 


160 


4 


1 - 10~ 5 


2 4 


640 


8 


1 - 10" 6 


2 5 
2 6 
2 7 
2 s 

2 9 

2 io 

2 11 
2 ia 

2 13 

2 14 

2 15 
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of Error P 
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of Error P 
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Figure 5. Information Throughput T Versus Message Length N and Probability 
of Error P 



-14- 













TRANSMISSION DELAY(D) =10 WORDS OVERHEAD (I) = 8 WORDS 




1.0 


p =io- 6 




^^P^ICT 5 ""^v 




0.9 


/? P = I0"«\ \ 






0.8 


r\ \ \ 






0.7 


/^ P =I0" 3 \ \ \ 






0.6 


/ \ \ 






t 


/ \ \ 






■-0.5 


/ \ \ 






0.4 


- |a \ \ 






0.3 


- 1 \ \ \ 






0.2 


1/ P =I0" 2 \ \ \ 






0.1 


1 J 1 1 1 1 \ 1 1 1^*^J 1 1 X ( 






I 2 3 4 5 6 7 8 9 10 1 1 12 13 14 15 16 1' 


r 


— 1 


LO62 OF MESSAGE LENGTH N (WORDS) 




ID 
fO 






4 







Figure 6. Information Throughput T Versus Message Length N and Probability 
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Figure 7. Information Throughput T Versus Message Length N and Probability 
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Figure 8. Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 9. Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 10. Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 11. Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 12. Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 13. Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 14. Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 15. Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 17. Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 18. Information Throughput T Versus Message Length N and Probability 
of Error P 
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Table II 

Optimum Message Length in Words for 
F Equal to 1-10' 2 Versus D and I 



\ I 

D \ 


1 


2 


4 


8 


10 


8 


16 


16 


32 


40 


8 


16 


16 


32 


160 


8 


16 


16 


32 


640 


8 


16 


16 


32 



Table IE 

Optimum Message Length in Words for 
P Equal to 1-10" 3 Versus D and I 



\ I 

D \ 


1 


2 


4 


8 


10 


32 


32 


64 


64 


40 


32 


32 


64 


64 


160 


32 


32 


64 


64 


640 


32 


32 


64 


64 
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Table IV 



Optimum Message Length in Words for 



1-4 



P Equal to 1-10"* Versus D and I 



\I 


1 


2 


4 


8 


10 


128 


128 


256 


256 


40 


128 


128 


256 


256 


160 


128 


128 


256 


256 


640 


128 


128 


256 


256 



Table V 

Optimum Message Length in Words for 
P Equal to 1-10" 5 Versus D and I 



\ I 

D \^ 


1 


2 


4 


8 


10 


256 


512 


512 


1024 


40 


256 


512 


512 


1024 


160 


256 


512 


512 


1024 


640 


256 


512 


512 


1024 
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Table VI 

Optimum Message Length in Words for 
P Equal to 1-10" 6 Versus D and I 



v. I 
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10 


1024 


1024 


2048 


2048 


40 


1024 


1024 


2048 


2048 


160 


1024 


1024 


2048 


2048 


640 


1024 


1024 


2048 


2048 
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SECTION V 
CONCLUSIONS 

In this report, a mathematical model of a data communications 
system using retransmission error control was developed. Sjystem per- 
formance is specified by information throughput, the amount of error-free 
data transferred per unit of time. Information throughput, T, is determined 
as a function of message length, N, taking into account transmission delay, 
D, channel error rate, P, and overhead words in the data message, I. The 
throughput expression given in Equation (19) was plotted in Figures 3 through 
18 for various values of N, D, P, and I. 

It is obvious from inspecting the curves that throughput is dependent on 
error rate and message length. If the error rate is 10~ 5 or less, then mes- 
sage length is very close to optimum over a broad range of values. This 
optimum is also greater than 0.95. Lower rates than 10~ 5 would bring the 
optimum closer to 1.0. For example, a nominal error rate of wire line might 
be 10" . To attain a lower error rate would require forward acting error- 
correction codes which would add a fixed percent of redundancy, C, to each 
data word. Typical values of C range from 0.33 to 0.5. Since the attainable 
increase in throughput is only 0.05 or less, the use of these codes, however, 
would actually decrease the throughput. 

Typical optimum throughputs at an error rate of 10~ 2 , however, range 
from 0.75 to 0.05. If error-correction coding can decrease the error rate 
to 10~ 5 or better, a net gain in information throughput can result. 
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A study of the advantages of error-correction coding is just one 
example of the usefulness of the model presented in this report. Other uses 
will depend on the problems of the communications system designer. 
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